Automatic generation of in-text hyperlinks in web publishing

نویسنده

  • Tomaž Šolc
چکیده

We present a method for automatic generation of in-text explanatory hyperlinks for use in web publishing. A system using this method is currently in production as part of a service for enriching plaintext content. We recognize the importance of link anchors in practical use of such systems, therefore the method is centered around link anchor selection and uses semantic similarity only to resolve ambiguities in the language. We use English Wikipedia as the training set which allows us to capture the current cultural knowledge. Using structured information extracted from Wikipedia we can provide explanatory links to articles in Wikipedia, book and movie databases and other pages on the Internet.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Professional electronic publishing in Hyper-G: The next generation publishing solution on the Web

The rst part of the paper identi es disadvantages of rst generation Web publishing solutions that have to be overcome for professional publication providers. Using Hyper-G for distribution of electronic documents opens the way to the rst fullyintegrated professional publishing solution on the Web. User and group access rights as well as billing mechanisms are integrated into the server, links a...

متن کامل

Improvement of generative adversarial networks for automatic text-to-image generation

This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...

متن کامل

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

Systematic literature review of fuzzy logic based text summarization

Information Overloadrq  is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq    informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...

متن کامل

A text mining approach for automatic construction of hypertexts

The research on automatic hypertext construction emerges rapidly in the last decade because there exists a urgent need to translate the gigantic amount of legacy documents into web pages. Unlike traditional ‘flat’ texts, a hypertext contains a number of navigational hyperlinks that point to some related hypertexts or locations of the same hypertext. Traditionally, these hyperlinks were construc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008